INVESTIGATION 



Mono and Dual Cofactor Dependence of Human 
Cystathionine (^-Synthase Enzyme Variants 
In Vivo and In Vitro 

Dago Dimster-Denk, 1 Katherine W. Tripp, 1 Nicholas J. Marini, Susan Marqusee, and Jasper Rine 2 

The California Institute for Quantitative Biosciences, Department of Molecular and Cellular Biology, University of 
California, Berkeley, California 94720 



ABSTRACT Any two individuals differ from each other by an average of 3 million single-nucleotide KEYWORDS 

polymorphisms. Some polymorphisms have a functional impact on cofactor-using enzymes and therefore cystathionine 

represent points of possible therapeutic intervention through elevated-cofactor remediation. Because most (3-synthase 

known disease-causing mutations affect protein stability, we evaluated how the in vivo impact caused by Saccharomyces 

single amino acid substitutions in a prototypical enzyme of this type compared with physical characteristics CYS4 

of the variant enzymes in vitro. We focused on cystathionine (3-synthase (CBS) because of its clinical vitamin B6 

relevance in homocysteine metabolism and because some variants of the enzyme are clinically responsive cysteine 

to increased levels of its B6 cofactor. Single amino-acid substitutions throughout the CBS protein caused biosynthesis 

reduced function in vivo, and a subset of these altered sensitivity to limiting B6-cofactor. Some of these B6- heme 
sensitive substitutions also had altered sensitivity to limiting heme, another CBS cofactor. Limiting heme 
resulted in reduced incorporation of heme into these variants, and subsequently increased protease sen- 
sitivity of the enzyme in vitro. We hypothesize that these alleles caused a modest, yet significant, destabi- 
lization of the native state of the protein, and that the functional impact of the amino acid substitutions 
caused by these alleles can be influenced by cofactor(s) even when the affected amino acid is distant from 
the cofactor binding site. 



The interpretation of human genome variation is a defining challenge 
in human biology for the foreseeable future. The unprecedented in- 
crease in both total sequence information and in the number of new 
gene variants has created a critical bottleneck in the ability to interpret 
the impact of the genetic variants in an individual. Clearly, advances in 
interpretation are crucial to realize the potential of personalized med- 
icine. As one measure of the magnitude of the challenge, any two 
unrelated humans differ by an average of 3 million single-nucleotide 
polymorphisms (SNPs). Among the SNPs in an individual, a current 
estimate is that 10,000 cause a change in the sequence of a protein 
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(Abecasis et al 2012; Tennessen et al 2012). Moreover, each individ- 
ual has changes in genes that have been implicated in human disease 
(Levy et al 2007). The pace of SNP discovery far outpaces the speed at 
which polymorphisms are experimentally evaluated. For mutations 
that cause disease with Mendelian inheritance due to a single amino 
acid substitution, the vast majority is believed to decrease the stability 
(free energy of folding) of the protein, leading to partial or global un- 
folding and either aggregation or degradation (Yue et al 2005). If this 
observation were to apply broadly to nonsynonymous SNPs, then ro- 
bustly predictive calculations of the energetic effects caused by these 
mutations could, in principle, make a valuable contribution to genome 
sequence interpretation. Well-characterized sets of variant forms of mul- 
tiple proteins are needed to help drive the development of such methods. 

A particular subclass of sequence variants of great interest and 
potential importance is the set of mutations whose deleterious effects 
on proteins are readily remediable by simple means. The prototype 
for such mutations were first highlighted in bacterial genetic studies 
of mutations in genes encoding certain vitamin- dependent enzymes 
that can be suppressed by increased levels of their cognate vitamins 
(Guirard et al 1971; Ames et al 2002). In addition, some human 
mutations cause clinical phenotypes sensitive to remediation by 
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increased vitamin dosages. We hypothesize that vitamin remediation 
occurs when a variant enzyme that has lost a crucial amount of free 
energy of folding can be compensated by the free energy of binding 
with the vitamin. In this case, the vitamin acts as a chemical splint, with 
the ligand-binding energy shifting the folding equilibrium and thereby 
making up for the partial loss of free energy caused by the mutation. 
Such variant proteins could be either dysfunctional, marginally func- 
tional, or substantially functional depending upon cofactor availability. 

In the human MTHFR gene, the majority of nonsynonymous 
changes in this enzyme's catalytic domain, found in a survey of non- 
clinical samples, have deleterious effects on the enzyme (Marini et al. 
2008). Moreover, for this enzyme, which participates in folate- driven, 
one-carbon metabolism, the deleterious impact of most such genetic 
substitutions can be suppressed by simply increasing the level of folate 
available to the cell. There is no crystal structure available for human 
MTHFR, thereby precluding structure-based approaches to assess the 
impact of these mutations, although phylogenetic approaches are 
promising (Marini et al. 2010). 

In the present study we turned to human cystathionine P -synthase 
(CBS), a vitamin- dependent enzyme whose structure is known, to 
explore the concept of cofactor remediation more deeply to determine 
its prevalence, and whether there are structural principles that can be 
illuminated with such alleles. In addition, well-characterized sets of alleles 
affecting a protein can serve as a test-bed for efforts such as the Critical 
Assessment of Genome Information (https://genomeinterpretation.org). 

CBS catalyzes the first step of cysteine biosynthesis via the trans- 
sulfuration pathway, which consumes homocysteine produced from 
methionine metabolism. Human CBS requires two cofactors for func- 
tion: vitamin B6 and heme. There are several biological forms of 
vitamin B6. The soluble form in vitamin supplements is pyridoxine, 
whereas the active form directly involved in CBS function is pyridoxal 
5'-phospate (PLP). Homocystinuria due to CBS deficiency (OMIM 
236200) is a recessive inborn-error of sulfur-amino acid metabolism, 
resulting in increased levels of homocysteine in the urine. More than 
140 different disease- associated mutations have been identified in the 
CBS gene (Kraus et al. 1999, 2013). Several alleles encode proteins that 
are clearly pyridoxine remediable: A 114V (pyridoxine Km variant), 
R266K, R369H, K384E, L539S, and the common I278T variant. 

The choice of a B6-dependent enzyme allowed us to test the 
generality of the observations from our studies of MTHFR variants 
and their responses to folate supplementation. The rationale for the 
selection of CBS was several-fold: (1) an in vivo assay for CBS activity 
and vitamin-responsiveness is established (Kim et al. 1997 Kruger 
and Cox 1994, 1995; Mayfield et al. 2012); (2) the literature on CBS 
mutations and disease establish this enzyme as metabolically signifi- 
cant (Meier et al. 2003); (3) clinically relevant vitamin B6-responsive 
variants provide a benchmark for validation; and (4) the crystal struc- 
ture allows for structural-based computational predictions of func- 
tional impact, including those based upon calculated free-energy-of 
folding changes (Meier et al. 2001). Clinically associated variants are 
inherently biased toward dysfunction. Therefore, for this study we 
focused our analysis on a set of designed variants with differing co- 
factor responses and examined possible conformational changes by 
in vitro measurements of thermolysin sensitivity, as recently demon- 
strated for studies of CBS (Hnizda et al. 2012a, 2010). 

MATERIALS AND METHODS 
Plasmids 

The plasmid pHUCBS was the kind gift of Warren Kruger (Kruger 
and Cox 1995). This plasmid contains the human CBS cDNA (mRNA 



reference sequence NM_000071, protein reference sequence NP_000062) 
and served as the source of the CBS coding region for all subsequent 
plasmid constructions. Polymerase chain reaction was used to amplify 
the CBS coding region and subclone the fragment into both a bacterial 
expression vector (2C-T, see Protein expression and purification below) 
as well as a yeast expression vector containing the TEF1 promoter and 
CYC1 terminator (p416-TEF; Mumberg et al. 1995). A derivative of 
this plasmid placed the hemagglutinin A epitope tag at the 3 '-end of 
the CBS coding region (pJR2858). Site-directed CBS variants were 
constructed using the QuickChange II Kit from Agilent (Santa Clara, 
CA). Random variant libraries were created using the Diversity PCR 
Random Mutagenesis Kit from Clontech (Mountain View, CA) and 
cloned into yeast expression vectors by cotransformation with gapped 
vector directly into yeast, allowing homologous recombination to con- 
struct the expression plasmids. Plasmids were rescued from yeast into 
Escherichia coli and sequenced. All mutations were constructed in the 
CBS clone containing the hemagglutinin A epitope tag. 

For all variant constructs, the entire CBS coding region was sequenced 
to ensure that a single amino acid change had been created. Each 
sequence-confirmed variant allele was then retransformed into yeast, 
and these transformants used for all subsequent analyses of phenotype 
and in vivo properties. 

Strains 

A haploid Saccharomyces cerevisiae strain containing a complete de- 
letion of the gene CYS4 y encoding yeast cystathionine |3- synthase, was 
obtained from the yeast knockout collection (Invitrogen), and served as 
the parent for all yeast strains used in this study (MATa cys4::KanMX 
his3Al leu2A0 lys2A0 ura3A0). A hemlA strain, deficient in 8-amino- 
levulinate synthase, was created by the direct transformation of the 
cys4A parent with a heml::LEU2 construct. Expression plasmids car- 
rying the human CBS major allele, or variants, were transformed into 
this strain by selecting for uracil prototrophy (Ito et al. 1983). 

Growth conditions 

We adopted the strategy of Kruger et al. for the implementation of an 
in vivo CBS assay (Kim et al. 1997; Kruger and Cox 1994, 1995) and 
was essentially as described previously (Mayfield et al. 2012). In sum- 
mary, plasmid-bearing strains were maintained on synthetic complete 
medium lacking uracil and containing glutathione (GSH), a tripeptide 
used as a source of cysteine to alleviate the need for functional CBS. 
Human CBS function was assayed based upon the growth of cys4A 
yeast strains in liquid and on solid minimal medium containing Yeast 
Nitrogen Base without vitamins (Qbiogene, Calsbad, CA) or GSH. 
Although these strains had wild- type versions of all MET genes, growth 
in minimal media also required the addition of methionine, presum- 
ably to augment a limiting intracellular concentration of homocysteine 
for the human enzyme. All vitamins except B6 (pyridoxine) were added 
back individually to levels present in standard medium. Pyridoxine was 
then added back at a range of concentrations, the "high" dose was 400 
ng/mL, the concentration of pyridoxine present in standard medium. 
Intermediate and low doses of pyridoxine were titrated in starting at 
4 ng/mL, a concentration slightly less than that required to achieve 
maximal growth. Note that even though S. cerevisiae requires PLP for 
growth and makes its own PLP, the level of PLP in wild-type yeast is 
not sufficient to satisfy the PLP requirement for human CBS when 
complementing the yeast cys4 mutation. 

For growth-rate measurements, strains containing CBS mutations 
were pregrown in minimal medium containing GSH to early stationary 
phase at 30°. Cells were diluted 1:200 in fresh medium lacking GSH, 
but with varying amounts of pyridoxine (starting cell densities were 
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■ Table 1 CBS Protein Variants Constructed and Analyzed 



Plasmid 
(pJR #) 


Nucleotide 
Mutation(s) 


Substituted 
Residue 


IxUofcrllcl \L\L\\3) 


Phenotypic 
Category 


400 t 


in/ml RA 
iy/ iii i do 


4 ng/mL B6 


2 n< 


i/ml RA 

J/ III I DO 


1 n< 


-./ml DA 

J/ III I DO 


2985 


194, A^T 


H65L 


1.2 


Nonfunctional 




0 




n/d 




n/d 




n/d 


2993 


424, AT^GC 


I142A 


3.7 


Nonfunctional 




0 




n/d 




n/d 




n/d 


2994 


425, T^A 


I142N 


9.7 


Nonfunctional 




0 




n/d 




n/d 




n/d 


2996 


460, CT^GC 


L154A 


1.8 


Nonfunctional 




0 




n/d 




n/d 




n/d 


2997 


461, T^C 


L154P 


6.7 


Nonfunctional 




0 




n/d 




n/d 




n/d 


2998 


529, A^G 


K177E 


1.2 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3001 


566, T^C 


V189A 


3.1 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3002 


620, C^G 


A207G 


2.0 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3003 


629, T^A 


L210Q 


4.2 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3005 


659, T^G 


L220R 


14.4 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3007 


684, C^A 


N228K 


5.5 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3008 


718, AT^GC 


1 240 A 


2.1 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3012 


755, T^C 


V252A 


2.6 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3013 


772, G^C 


G258R 


13.7 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3018 


829, AT^CC 


I277P 


5.1 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3023 


967, T^G 


W323G 


6.3 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3035 


1153, TTC— >CAA 


F385Q 


5.0 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3038 


1268, T^C 


L423P 


n/a 


Nonfunctional 




0 




n/d 




n/d 




n/d 


3040 


1370, G^A 


G457E 


n/a 


Nonfunctional 




0 




n/d 




n/d 




n/d 


2988 


353, T^G 


V118G 


1.9 


Sensitive 


0.91 


± 0.04 


0.81 


± 0.25 


0.49 


± 0.23 * 


0.39 


± 0.07 ** 


2991 


379, A^G 


1127V 


1.5 


Sensitive 


1.22 


± 0.11 


1.25 


± 0.07 * 


1.35 


± 0.15 


0.97 


± 0.08 


2995 


424, A^G 


1142V 


1.3 


Sensitive 


0.66 


± 0.03 ** 


0.68 


± 0.07 * 


0.53 


± 0.07 


0.38 


± 0.10 


2999 


541, CT^GC 


L181A 


3.1 


Sensitive 


0.92 


± 0.23 


0.91 


± 0.12 


0.56 


± 0.11 


0.34 


± 0.22 * 


3000 


562, A^G 


1188V 


1.0 


Sensitive 


1.19 


± 0.23 


1.41 


± 0.38 


1.42 


± 0.53 


1.93 


± 0.64 * 


3006 


674, A^G 


N225S 


1.4 


Sensitive 


0.71 


± 0.10** 


0.62 


± 0.11 ** 


0.35 


± 0.09 ** 


0.36 


± 0.07 ** 


3015 


799, A^G 


K267E 


0.9 


Sensitive 


0.68 


± 0.38 


0.67 


± 0.39 


0.47 


± 0.28 ** 


0.26 


± 0.15** 


3020 


856, AT^GC 


I286A 


1.1 


Sensitive 


0.52 


± 0.10 


0.43 


± 0.05 * 


0.18 


± 0.05 * 


0.13 


± 0.11 * 


3021 


877, C^G 


L293V 


1.1 


Sensitive 


0.57 


± 0.08 


0.50 


± 0.05 * 


0.17 


± 0.04 * 


0.25 


± 0.06 * 


3024 


1012, CT^GC 


L338A 


1.7 


Sensitive 


0.98 


± 0.11 


0.93 


± 0.17 


0.77 


± 0.13 * 


0.74 


± 0.14 


3026 


1034, T^C 


L345P 


-1.2 


Sensitive 


0.57 


± 0.04 ** 


0.49 


± 0.07 ** 


0.16 


± 0.03 ** 


0.20 


± 0.14** 


3027 


1061, T^G 


V354G 


7.5 


Sensitive 


0.67 


± 0.09 ** 


0.72 


± 0.05 ** 


0.48 


± 0.09 ** 


0.54 


± 0.10** 


3031 


1112, T^C 


V371A 


2.7 


Sensitive 


1.15 


± 0.29 


1.15 


± 0.37 


1.32 


± 0.20 * 


1.95 


± 0.54 


3036 


1153, T^C 


F385L 


5.5 


Sensitive 


0.85 


± 0.15 


0.72 


± 0.05 ** 


0.37 


± 0.07 ** 


0.37 


± 0.03 ** 


2986 


250, A^G 


1 84V 


1.0 


Benign 


0.99 


± 0.05 


1.10 


± 0.28 


1.13 


± 0.24 


1.31 


± 0.34 


2990 


370, CT^GC 


L124A 


2.3 


Benign 


0.83 


± 0.06 


1.00 


± 0.06 


0.97 


± 0.07 


1.12 


± 0.11 


2989 


370, C^G 


L124V 


2.1 


Benign 


0.76 


± 0.16 


1.10 


± 0.18 


1.09 


± 0.10 


1.62 


± 0.45 


2992 


418, G^A 


D140N 


3.0 


Benign 


1.13 


± 0.28 


0.93 


± 0.11 


0.86 


± 0.08 


0.71 


± 0.07 


3004 


640, A^G 


1214V 


1.1 


Benign 


0.89 


± 0.18 


1.16 


± 0.19 


1.07 


± 0.17 


1.11 


± 0.08 


3009 


718, A^G 


1 240V 


1.0 


Benign 


0.90 


± 0.24 


1.14 


± 0.21 


0.90 


± 0.05 


1.02 


± 0.33 


3010 


721, C^G 


L241V 


1.4 


Benign 


1.01 


± 0.13 


1.03 


± 0.19 


0.91 


± 0.19 


0.92 


± 0.41 


3011 


742, CT^GC 


L248A 


1.2 


Benign 


0.81 


± 0.08 


0.84 


± 0.11 


0.73 


± 0.14 


0.71 


± 0.08 


3014 


791, T^C 


I264T 


3.3 


Benign 


1.12 


± 0.33 


1.13 


± 0.20 


1.05 


± 0.33 


1.08 


± 0.31 


3016 


800, A^T 


K267M 


-1.8 


Benign 


1.02 


± 0.18 


1.07 


± 0.15 


1.10 


± 0.21 


1.15 


± 0.18 


3017 


811, A^G 


K271E 


0.4 


Benign 


0.87 


± 0.18 


0.89 


± 0.28 


0.80 


± 0.20 


0.87 


± 0.31 


3019 


839, T^C 


V280A 


2.5 


Benign 


1.10 


± 0.30 


1.12 


± 0.27 


1.13 


± 0.23 


1.41 


± 0.21 


3022 


931, A^G 


1311V 


0.7 


Benign 


1.03 


± 0.15 


1.33 


± 0.50 


1.12 


± 0.44 


1.41 


± 0.75 


3025 


1023, A^T 


Q341H 


-2.5 


Benign 


1.09 


± 0.23 


0.85 


± 0.20 


0.86 


± 0.19 


0.86 


± 0.10 


3028 


1067, T^C 


V356A 


1.3 


Benign 


0.98 


± 0.12 


1.04 


± 0.06 


0.94 


± 0.00 


0.92 


± 0.08 


3029 


1070, C^G 


A357G 


1.6 


Benign 


1.09 


± 0.16 


1.17 


± 0.23 


0.99 


± 0.23 


0.77 


± 0.19 


3030 


1073, T^C 


V358A 


1.6 


Benign 


0.92 


± 0.10 


0.90 


± 0.05 


0.87 


± 0.14 


0.73 


± 0.08 


3032 


1115, T^C 


V372A 


2.7 


Benign 


0.75 


± 0.39 


0.85 


± 0.48 


0.68 


± 0.38 


0.68 


± 0.37 


3033 


1120, CT^GC 


L374A 


2.3 


Benign 


1.09 


± 0.10 


1.06 


± 0.20 


1.17 


± 0.24 


1.28 


± 0.22 


3034 


1147, A^T 


T383S 


1.1 


Benign 


0.95 


± 0.15 


0.87 


± 0.14 


0.70 


± 0.10 


0.83 


± 0.22 


3037 


1223, G^T 


W408L 


n/a 


Benign 


1.04 


± 0.03 


1.13 


± 0.17 


1.19 


± 0.16 


1.08 


± 0.10 


3039 


1298, A^T 


H433L 


n/a 


Benign 


0.98 


± 0.22 


0.93 


± 0.30 


0.83 


± 0.21 


0.72 


± 0.13 


3041 


1468, A^C 


I490L 


n/a 


Benign 


0.86 


± 0.49 


0.74 


± 0.42 


0.74 


± 0.43 


0.87 


± 0.49 


3042 


1646, A^G 


D549G 


n/a 


Benign 


0.95 


± 0.13 


0.80 


± 0.11 


0.69 


± 0.10 


0.65 


± 0.12 


3043 


1675, T^C 


Y559H 


n/a 


Benign 


1.36 


± 0.10 


1.18 


± 0.32 


1.15 


± 0.35 


1.07 


± 0.37 



The 58 single amino-acid substitutions constructed within the human CBS coding region are listed. The engineered nucleotide change and the resulting change in 
amino acid residue are shown (reference sequence NM_000071). The calculated change in free energy based on the Rosetta prediction algorithm for each 
substitution is also listed (AAG, kcal/mol), with the exception of seven substitutions located in the N-terminal portion of the protein that is not part of the structure and 
therefore no predictions were available (indicated as n/a). All of the protein variants were assigned to one of three phenotypic categories based on empirical 
measurements of growth: benign, sensitive, or nonfunctional. The relative growth rates of yeast containing 39 single-amino-acid human CBS substitutions with some 
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OD595 - 0-01 in a volume of 150 jjlL). Cell number was measured every 
30 min for 72—96 hr in a Tecan GENios plate reader at 27—28° 
without shaking. For growth-rate calculations, raw OD 595 values were 
normalized to an endpoint read after the cells were resuspended. The 
normalized OD 595 values were then log 10 transformed, and the growth 
rate determined from the slopes calculated between two fixed cell 
densities (0.05-0.1 for HEM1 cells, and 0.1-0.2 for hemlA cells). 
A plate normalization factor was applied to account for plate-to-plate 
variations, and the variant growth rates expressed as a ratio relative to 
the growth rate resulting from the major allele. 

Thermodynamic predictions of the impact of single 
amino acid variations 

The Rosetta computational model for predicting the structural effect 
of single amino acid substitutions involves a process whereby the side- 
chain rotamers are sampled while keeping the protein backbone fixed 
(Bonneau et al. 2002; Das and Baker 2008; Kellogg et al. 2011; Rohl 
et al. 2004). The calculated energy function consists of an all-atom 
description including physical and statistical terms. Results from ear- 
lier work established a benchmark to assess the predictions from this 
project. In that work, 985 point mutations in 77 different protein 
structures were examined. The Pearson correlation coefficient between 
the predicted and the experimental assessment of function was 0.64 
for the entire set (D. Baker and P. Bradley, unpublished data). 

Protein expression and purification 

A bacterial expression clone containing a 6His-MBP-CBS fusion was 
obtained from Macrolab at UC Berkeley (2C-T vector; http://www. 
addgene.org/29706/). Substitutions within the CBS coding region were 
then constructed using site- directed mutagenesis and confirmed by 
DNA sequencing, as was done with the yeast CBS expression con- 
structs. MBP-CBS fusions were expressed in Rosetta 2 (DE3) cells 
(Novagen; www.emdmillipore.com). Cells were grown in Luria broth 
with 200 u>g/mL ampicillin at 37° to an OD 600 between 0.6 and 0.8, 
and then the temperature was shifted to 20° and the cultures were 
allowed to grow for an additional hour. Protein expression was then 
induced by the addition of 1 mM IPTG (Gold Biotechnology, St. 
Louis, MO) and either 0.3 or 0.6 mM 8 -ALA ( Sigma- Aldrich, St. 
Louis, MO) and the cells grown at 20° for an additional 15—20 hr. 

Cells were harvested by centrifugation and resuspended in 50 mL 
of lysis buffer (50 mM Tris, pH 8; 300 mM NaCl; 0.5 mM TCEP; 50 
|jlM pyridoxine-HCl; Sigma- Aldrich) with a cocktail of protease inhib- 
itors (Roche Complete, EDTA-free). The cells were then lysed by 
sonication, and the supernatant fraction collected by centrifugation. 
Proteins were purified from the lysis supernatant using a Ni-NTA 
affinity resin (QIAGEN, Valencia, CA), eluted from the column in 
lysis buffer with 250 mM imidazole, and then dialyzed into lysis buffer 
to remove the imidazole and concentrate the purified protein. Protein 
concentrations were determined by measuring the absorbance of the 
aromatic side chains (Edelhoch 1967). 

Absorbance spectra 

Absorbance spectra were taken on purified proteins ranging in 
concentration from 20 to 100 mM and then normalized to protein 
concentration for comparison. The ratio of the heme 430 nm absorbance 



peak and the aromatic 280 nm absorbance peak was one-to-one, 
indicating nearly 100% heme incorporation in the major allele protein 
(Majtan et al. 2010). 

Proteolysis kinetics 

Protease susceptibility of purified proteins was measured using 
thermolysin. The purified protein sample was diluted to approxi- 
mately 10 mM in lysis buffer. The reaction was initiated by the 
addition of thermolysin at a final concentration of 10 |jLg/mL at room 
temperature. At defined time intervals a portion of the reaction was 
removed, and the protease digestion quenched by the addition of 
ethylenediamine tetraacetic acid at a final concentration of 12.5 
mM. Samples were analyzed by 8% sodium dodecyl sulfate polyacryl- 
amide gel electrophoresis. 

The MBP-CBS protein fusion, as well as the cleaved CBS and MBP 
fragments, were excised from the gel and their identity confirmed by 
mass spectrometry by the following procedure. A nano LC column 
packed with 10 cm of Polaris cl8 5 |jim packing material (Varian) was 
loaded with the protein sample by use of a pressure bomb and washed 
extensively with buffer A. The column was then directly coupled to an 
electrospray ionization source mounted on a Thermo- Fisher LTQ XL 
linear ion trap mass spectrometer. An Agilent 1200 HPLC equipped 
with a split line, so as to deliver a flow rate of 30 nL/min, was used for 
chromatography. Peptides were eluted with a 90-min gradient from 
100% buffer A to 60% buffer B. Buffer A was 5% acetonitrile/0.02% 
heptaflurobutyric acid; buffer B was 80% acetonitrile/ 0.02% hepta- 
flurobutyric acid. The programs SEQUEST and DTASELECT were 
used to identify peptides and proteins from a database consisting of 
common contaminants and the engineered sequences of the proteins 
(Eng et al 1994; Tabb et al 2002). 

RESULTS 

CBS-substitution yeast growth phenotypes 

We used a cell-based assay to assess the impact of single amino acid 
substitutions upon the function of the human CBS enzyme. A cDNA 
clone of the major allele of the human CBS gene, or variant forms of 
the gene, was expressed in S. cerevisiae and tested for functional com- 
plementation of a null allele of the orthologous yeast gene, CYS4. This 
strategy has several desirable qualities: (1) expression of the human 
clone can be driven by a heterologous promoter and terminator so as 
to minimize transcriptional effects on protein levels; (2) the expression 
level of the complementing human clone can be modulated so as to 
restore growth but remain limiting, thereby providing a sensitive assay 
for both partial loss-of-function alleles or enhanced- function alleles; (3) 
defined growth media allow for the titration of exogenous pyridoxine 
or bypass of CBS function entirely by the addition of cysteine in the 
form of GSH to the growth medium; and (4) the yeast genome can be 
engineered so as to alter other intermediates of the metabolic pathway. 
This assay was used to rank order variant activities as well as vitamin, 
or heme, responsiveness. We recently used this assay to catalog the 
functional impact of the majority of CBS polymorphisms identified 
clinically (Mayfield et al 2012). 

The focus of this study was to examine the effect of single amino 
acid substitutions on CBS function and the associated impact of 
cofactor availability in a more direct fashion. Two strategies were used 



degree of function are listed. Growth rates are expressed as a ratio relative to the yeast strain expressing the major human CBS allele, grown with the same amount of 
exogenous pyridoxine supplementation (either 400, 4, 2, or 1 ng/rmL pyridoxine), with the SD indicated (determined from at least three independent yeast trans- 
formants). Growth rates that differed significantly from the major allele in the same media conditions are indicated either by a single asterisk (P < 0.01) or a double 
asterisk (P < 0.001). Growth rates that were not determined are indicated (n/d). CBS, cystathionine ^-synthase. 
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Figure 1 Positions of the altered CBS amino acid 
residues on the 3D structure of the truncated CBS 
protein. The pyridoxine and heme cofactors are shown 
in black. (A) Forty-four amino acid residues representing 
51 of the 58 substitutions are shown in red. Seven of the 
substitutions were in residues not included in the 3D 
structure and are not shown. (B) The fourteen residues 
that, when substituted, resulted in CBS protein variants 
with altered cofactor sensitivity in vivo. All variants dem- 
onstrated altered sensitivity to pyridoxine; the two var- 
iants that had altered sensitivity to both heme and 
pyridoxine (K267E and L345P) are shown in green. 



to construct a new collection of CBS variants, specifically trying to 
enhance moderate functional impacts. The first was a directed 
strategy aimed at protein residues not directly involved in either 
the binding of cofactor or the catalytic pocket, but dispersed 
throughout the protein, that might result in a range of impacts on 
protein function. The primary criteria were that the residue be (1) 
buried within the hydrophobic core (i.e., no solvent accessibility), 
and (2) distal from the PLP cofactor-binding site. Also, catalytic 
residues were excluded because mutations of catalytic sites would 
have clear deleterious effects independent of their effect on the free 
energy of folding. We used Rosetta-based predictions of the ther- 
modynamic impact of single amino acid substitutions as an ap- 
proximate guide to be able to test a presumed range of destabilizing 
mutations. The resulting set of 31 single amino-acid substitutions 
was predicted to have a range of destabilizing impacts, from mild 
to severe. The second strategy was random mutagenesis designed 
to be unbiased with respect to protein residue and substitution 
and resulted in another set of 27 single- amino- acid substitutions. 
In total, 58 single amino acid substitutions were constructed in 
the human CBS gene by these two strategies, and functionally 
assayed in vivo. The complete list of substitutions and their predicted 



impact on the thermodynamic stability of the protein based on the 
Rosetta profile is shown in Table 1. 

The observed growth phenotypes were assigned to one of three 
categories. The first comprised those changes that failed to confer 
growth even at the highest concentration of exogenous pyridoxine 
supplementation (400 ng/mL). These 19 substitutions created a non- 
functional enzyme as defined by the in vivo growth assay. The remain- 
ing two categories together comprised 39 substitutions that were able 
to confer growth to some degree. Their in vivo activities were de- 
termined at four concentrations of exogenous pyridoxine supplemen- 
tation (Table 1). Twenty-five of these substitutions supported growth 
rates comparable to wild type at all four concentrations of pyridoxine 
supplementation, and were defined as benign. The final category of 14 
substitutions conferred a growth rate that was significantly different 
from that of yeast with the major allele of human CBS in at least one 
concentration of pyridoxine supplementation (pair-wise t test; P < 
.01). Although the majority of these substitutions exhibited slower 
growth rates relative to wild type, particularly at low pyridoxine con- 
centrations, three substitutions (1 127V, 1 188V, and V371A) conferred 
better growth than wild type. We refer to this category of 14 substi- 
tutions as being cofactor sensitive. 
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Figure 2 Yeast in vivo growth complementation assay for human CBS function. (A) Scatter plot showing the predicted change in free energy 
(Predicted AAG, x-axis) for 52 human single-amino-acid CBS substitutions vs. their conferred growth rates expressed relative to that of the major 
human CBS allele grown with 400 ng/mL pyridoxine supplementation in HEM1 yeast strains (y-axis). The alleles were categorized by growth 
phenotype as benign, sensitive, or nonfunctional (see Results). (B) Assessment of the calculated changes in free energy to predict the phenotypic 
outcome for 52 human CBS substitutions. The alleles were binned based on the predicted change in free energy (AAG) in 0.5 kcal/mol incre- 
ments. For each incremental threshold a percentage "correct" was calculated by summing the number of alleles growing below the threshold 
added to the number of alleles not growing above the threshold, divided by the total number of alleles (i.e., 52). The dotted line indicates 
a predicted AAG of 3.5 kcal/mol Rosetta units. 
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Table 2 Growth Rates Supported by CBS Variants in a hem 1A Background 



Substituted 400 ng/mL B6 and 50 |xg/mL 400 ng/mL B6 and 5 |xg/ml 2 ng/mL B6 and 50 |xg/ml 2 ng/mL B6 and 5 |xg/ml_ 



Residue 


8-ALA 


8-ALA 


8-ALA 


8-ALA 


V118G 


0.90 ± 0.06 


0.78 ± 0.12 


0.60 ± 0.15 


0.56 ± 0.07 


1142V 


0.82 ± 0.07 


0.74 ± 0.12 


0.69 ± 0.14 


0.87 ± 0.08 


L181A 


0.64 ± 0.06 


0.59 ± 0.03 


0.57 ± 0.09 


0.59 ± 0.08 


N225S 


0.74 ± 0.11 


0.66 ± 0.06 


0.55 ± 0.12 


0.45 ± 0.07 


K267E 


0.61 ± 0.00 


n/d 


0.57 ± 0.18 


n/d 


I286A 


0.64 ± 0.03 


0.58 ± 0.02 


0.46 ± 0.06 


0.34 ± 0.05 


L293V 


0.69 ± 0.07 


0.67 ± 0.10 


0.47 ± 0.07 


0.36 ± 0.06 


L345P 


0.58 ± 0.01 


0.07 ± 0.03 


0.25 ± 0.06 


0.05 ± 0.01 


F385L 


0.77 ± 0.05 


0.69 ± 0.07 


0.58 ± 0.10 


0.54 ± 0.11 



Relative growth rates of a subset of 9 B6-remediable single amino-acid human CBS substitutions expressed in yeast are listed. Each variant was grown in four media 
conditions; 400 ng/mL pyridoxine and 50 |xg/mL d-ALA, 400 ng/mL pyridoxine and 5 |xg/mL d-ALA, 2 ng/mL pyridoxine and 50 |xg/rmL d-ALA, and 2 ng/mL 
pyridoxine and 5 |xg/mL d-ALA. As in Table 1 , growth rates are expressed as a percentage relative to the yeast strain expressing the major allele of human CBS grown 
with the same media. Averages were determined from at least three independent yeast transformants of each CBS variant. Conditions in which a variant was unable to 
grow sufficiently to determine a growth rate are indicated (n/d). CBS, cystathionine ^-synthase. 



The distributed nature of functional classes 
of mutations 

Some alleles led to proteins with decreased yet remediable function. For 
these mutants, their loss or reduction in function could be compen- 
sated by the binding of a cofactor. If the effect of these mutations on 
the enzyme were local to the altered site, remediable alleles would 
be expected to be close in structure to the cofactor-binding site. If, 
however, the effect were more global, such as a general destabilization 
of the protein, then remediable alleles may occur at a considerable 
distance from the cofactor-binding site. 

We used the structure of a truncated version of the major allele 
protein as a framework for exploring the relative distribution of different 
classes of variants. We mapped the positions of the 44 mutated residues 
with positional information (representing 51 mutations, 7 mutations 
were in residues not included in the solved structure) to examine if 
there were any "hot spots" for the phenotypic categories (Figure 1). 
There was no obvious clustering of any phenotypic classes to any 
particular regions. Indeed, the mutated residues were located through- 
out the structure and the different phenotypic categories were inter- 
spersed. The lack of geographic bias added strength to the view that 
the phenotypic categories were the result of distributed influences and 
were not limited to region- specific properties. 

To explore possible associations between physical characteristics of 
substitutions and observed growth phenotype in a more quantitative 



fashion, alleles were grouped based their growth phenotype: non- 
functional, sensitive, or benign; each of these three groups were 
further divided based on the method of molecular construction, 
directed or random (six groups in all). For each category, five physical 
characteristics were determined (distance between the substitution 
and heme, distance between the substitution and PLP, solvent accessi- 
bility, regular secondary structure, and crystal structure B -factor), and 
category averages were calculated. For comparison, an equal number 
of alleles were randomly assigned to each category 10,000 times and 
category averages for the same five measurements performed allowing 
a rigorous statistical test for the probability that the test category is 
higher or lower than random. 

As expected, the directed variants had significantly less solvent 
accessibility than the randomized controls because this was a criterion 
in their selection. This correlation provided a positive control that the 
analysis could detect correlations if they existed among the data. 
Among the randomly generated variants, those with nonfunctional 
or sensitive phenotypes were also less solvent accessible than either 
the controls or the benign random substitutions. However, no other 
physical parameter examined was characteristic of any of the pheno- 
typic categories. The remediable alleles were no closer to either of the 
cofactor binding sites, nor did they have altered occurrence in regular 
secondary structure, or correlate with high or low crystallographic 
B -factor values. 




L181A K267E I286A L293V L345P 



L181A K267E I286A L293V L345P 



Figure 3 CBS variant growth rates at various levels of exogenous cofactor supplementation. (A) Growth rates for six variants grown with four levels 
of pyridoxine supplementation (400, 4, 2, and 1 ng/mL) in HEM 7 yeast strains. Growth rates are expressed relative to yeast with the major human 
CBS allele grown with the same media conditions. (B) Growth rates for five of the same variants grown in hemlA strains, in either a high level of 
pyridoxine (400 ng/mL, left panel) or a low level of pyridoxine (2 ng/mL, right). Additionally, these strains were grown with either high 8-ALA 
supplementation (50 fxg/mL) or low 8-ALA supplementation (5 fxg/mL). The K267M variant was omitted because it did not show any sensitivity to 
pyridoxine levels (A). All error bars indicate the standard deviation of at least three independent transformants. 
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Figure 4 Heme content of purified variant CBS proteins. Photographs of cuvettes containing equal concentrations of each purified protein are 
shown on the left. (A) Absorption spectra demonstrating the level of heme incorporation for seven CBS proteins. Bacterial cells expressing the 
various CBS proteins were supplemented with 0.3 mM 8-ALA in the growth medium. (B) Increased incorporation of heme for the CBS variants 
when expressed with an increased concentration of exogenous 8-ALA in the growth medium (0.3 mM [solid symbols] vs. 0.6 mM [open symbols]). 



Correlations of thermodynamic predictions with 
growth phenotypes 

Although the Rosetta algorithm was used to select mutations with a 
range of predicted effects on stability, the correlation between the 
predicted effects on stability and observed effects on growth was 
poor. The only reliable correlations were for those variants predicted 
to be severely destabilizing (>4 kcal/mol on the Rosetta scale). Nine 
of the 1 1 substitutions in this category resulted in a no-growth, no- 
remediation phenotype (Figure 2A). The two remaining substitu- 
tions (F385L and V354G) both were impaired and sensitive to low 
levels of pyridoxine, in close agreement with a previous study (Wei 
et al. 2010). Thus the 3.5- to 4-kcal/mole AAG prediction marked 
a threshold above which all examined CBS variants were impaired, 
and indeed most were completely nonfunctional (Figure 2B). 

In contrast, the majority of mutations were predicted to cause 
a more modest AAG effect and were selected precisely because of their 
predicted intermediate effects on stability. These variants fell into all 
three growth-phenotype categories (Figure 2A). There were no signif- 
icant correlations between growth phenotype and the predicted 
change in stability for mutations in this range of destabilization. 

Characterization of cofactor remediation 

Cofactor-remediable substitutions occurred at a significant distance 
from the binding site of that cofactor, which is consistent with the 
hypothesis that the effects of remediable alleles were not local, and 
were more likely a result of a modest destabilization of the protein. If 
true, the cofactor sensitivity of a given variant may not be specific to 
one cofactor. That is, interaction with a different cofactor, which also 
binds at a distance from the substitution, may also provide enough 
stabilization to at least partially compensate for the stability lost due to 
the substitution. We examined this question by transforming the first 
nine B6-remediable alleles that we identified into a yeast hemlA strain 
in which we could control the relative level of intracellular heme, the 
other cofactor for CBS, by supplementing with either high or low 
concentrations of 8-amino levulinic acid (8-ALA). 8-ALA is the prod- 
uct of Hemlp and a precursor in heme biosynthesis (Table 2). Two 
alleles, K267E and L345P, which were sensitive to low pyridoxine 
levels, were also extremely sensitive to low 8-ALA concentrations. 



Neither of these substitutions was particularly close to either cofactor- 
binding site; in fact L345P is on the opposite side of the monomer 
from the heme-binding site (Figure 1). Therefore, these two alleles 
encoded proteins that were severely impaired by low concentrations 
of either cofactor, yet could be significantly remediated by high con- 
centrations of either. 

Physical analysis of heme incorporation 

To probe the nature of the remediable alleles, we focused on the 
cofactor sensitivity of a subset of substitutions for in vitro analysis. 
Considering that substitutions predicted to have changes in free en- 
ergy less than 3.5 kcal/mol produced protein variants of all phenotypic 
categories, we selected six variants for this analysis that were predicted 
to have modest destabilizing effects, yet represented three of the phe- 
notypic categories (the nonfunctional alleles were omitted). Of these, 
two variants exhibited heightened sensitivities to levels of both cofac- 
tors, pyridoxine and heme (K267E and L345P), an additional three 
variants showed sensitivity to reduced pyridoxine levels but not heme 
levels (L181A, I286A, and L293V), and the final variant selected was 
benign (K267M; Figure 3, A and B). 

For the in vitro studies, the full-length CBS coding region was 
fused to that of the maltose binding protein (MBP), which facilitated 
the solubility of bacterially expressed CBS variants. To aid in the in- 
tracellular production of heme and its subsequent incorporation into 
the CBS protein during expression, the bacterial growth medium was 
supplemented with 8-ALA (Kery et al. 1994). This MBP-CBS fusion 
protein was soluble when expressed in bacteria and could be readily 
purified and assayed. The purification yield varied significantly among 
the seven selected proteins (major allele and six variants) and corre- 
lated with the intensity of a yellow/brown color of the purified protein 
(Figure 4). This observation suggested that the amount of heme asso- 
ciated with the purified variants also varied (Kery et al. 1994). Indeed, 
the amount of protein-bound heme, measured spectrophotometrically, 
varied significantly among the protein variants and correlated with the 
observed color (Figure 4A) and with activity in vivo as measured by 
growth rate. 

The observed differences in heme sensitivity in vivo and heme 
incorporation in vitro suggested both that this collection of proteins 
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varied in their ability to bind heme, and that changes in the availability 
of heme influenced its incorporation, and subsequent enzymatic func- 
tion. For example, K267E and L345P, which were most sensitive to 
heme availability in vivo, incorporated the least heme, as measured 
in vitro. Several of these variants, including the major allele, were ex- 
pressed in cells grown at two concentrations of 8 -ALA, purified and 
evaluated for heme incorporation (Figure 4B). Increased heme avail- 
ability resulted in marginal gains in the low level of heme incorporated 
into L345P and no detectable increase in K267E, consistent with the 
extreme sensitivity of these two variants to lowered levels of heme. In 
contrast, L293V, I286A and the major allele form of CBS had more 
associated heme under these conditions, presumably reflecting a 
greater fraction of CBS molecules that had incorporated heme, and 
supported greater growth. 

Physical analysis of protein stability 

One method to interrogate the stability and dynamics of a purified 
protein is to examine its sensitivity to proteases (Park and Marqusee 
2006). The differences in in vivo function, heme content in the puri- 
fied proteins, and their altered sensitivity to pyridoxine levels collec- 
tively implied that the alleles described in Figure 4 likely resulted 
in a conformational perturbation of the protein. Therefore, we 
examined their relative sensitivity to thermolysin cleavage. The 
thermolysin-sensitivity of the major allele full-length CBS and variants 
has been used previously to detect whether substitutions affected the 
conformational stability for other variants of CBS (Hnizda et al. 2012a). 

Under the conditions used here, the MBP fragment was quickly 
cleaved from the full-length CBS domain, even at the lowest protease 
levels tested. The remaining full-length major-allele CBS remained 
largely resistant to further degradation by thermolysin treatment for 
20 min (Figure 5). Longer protease time points eventually degraded 
the full-length CBS (not shown). The sensitivity to thermolysin di- 
gestion varied significantly among the alleles. Importantly, the degree 
of sensitivity correlated with heme content. The less heme incorpo- 
rated in the protein, the more sensitive the CBS variant was to pro- 
tease digestion (Figure 6). These results implied that heme-binding 
stabilized the protein to proteolysis. 

DISCUSSION 

In this study we interrogated the relationship between protein structure 
and cofactor-level sensitivity of mutant variants of human CBS, a B6- 
dependent enzyme with a key role in metabolism. Importantly, cofactor- 
remediable variants occurred at positions dispersed throughout the 
structure of the protein. The dually remediable variants, responsive to 
both B6 and heme cofactors, were striking in that the changes in these 
variants were not proximal to the binding site of either cofactor. The dual 
B6- and heme-level sensitivities of two variants in particular raised the 
possibility that iron deficiency, or other contributions to heme function, 
might also impact enzyme function in humans carrying these alleles. 

Five of the six single amino acid substitutions of CBS selected for 
physical analysis exhibited altered sensitivity to the B6 cofactor in vivo 
(the sixth, functionally benign K267M variant, was included in this set 
as a control), but displayed differing phenotypes with respect to heme- 
level sensitivity. Three variants, (L181A, I286A, and L293V) exhibited 
sensitivity to reduced pyridoxine levels relative to the major allele, but 
no heightened sensitivity to reduced heme levels based upon growth- 
rate measurements. However, there was a range of heme incorporation 
among these alleles in vitro. The L181A variant appeared to incorporate 
a level of heme similar to the protein encoded by the major allele, 
although the other two CBS variants incorporated less heme. These 
variants established that heightened B6-level sensitivity was not nec- 
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Figure 5 Protease digestion of the purified major allele MBP-CBS 
fusion protein. After treatment with thermolysin for the indicated time 
(minutes), the digested protein is separated by polyacrylamide gel 
electrophoresis. The full-length CBS protein (with MBP removed) runs 
as a doublet; the identities of all of the labeled bands were confirmed 
by Mass-Spectrometry analysis, lota marks the lane with no protease. 

essarily coupled with discernable effects on heme loading or confor- 
mational stability, and suggested that there may be a threshold with 
respect to heme loading and protein function in vivo. 

Perhaps the most interesting alleles were those that encode K267E 
and L345P, which exhibited the most extreme in vivo sensitivity to 
reduced levels of either pyridoxine or heme. Upon purification both 
variant proteins bound far less heme than wild-type CBS, and ex- 
hibited the most enhanced thermolysin sensitivity of the seven CBS 
versions tested. Heme incorporation into CBS protein appears to be 
obligatorily co-translational, and reduced heme binding in turn affects 
PLP binding (Kery et al. 1994; Oliveriusova et al. 2002). Thus variants 
with reduced heme binding would be expected to alter PLP binding 
and yield a protein with reduced function, as was observed for these 
two alleles. But at a fixed level of heme availability, and presumably 
therefore at a fixed level of heme incorporation, these two variants also 
showed altered activity to pyridoxine concentrations in vivo. It would 
appear that variants with heightened sensitivity to the levels of two 
cofactors commonly have heightened conformational flexibility, as 
measured by protease sensitivity, relative to variants with more spe- 
cific sensitivities. 

Taken to a logical conclusion, these striking examples raise the 
possibility that a small molecule ligand that binds to one position of 
a protein can suppress the impact of a destabilizing mutation at 
a distant site. This notion is consistent with the observation that the 
physical impacts of a number of clinical CBS alleles result in subtle 
alterations of protein structure (Hnizda et al. 2012b) and that small 
molecule chaperones can rescue function (Kopecka et al. 2011; Majtan 
et al. 2010). If so, small molecules that restore function to variant 
proteins could have significant translational importance. Moreover, 
there is growing recognition of cryptic binding sites for small mole- 
cules that are obscured in crystal structures but revealed in molecular 
dynamic simulations (Bowman and Geissler 2012). 

Clearly, experimental evaluation of all amino acid substitutions in 
the human proteome would be technically daunting. Hence robust 
computational prediction of the impact of variation would be desirable 
if personalized medicine is to reach its full potential. However, the 
results described here demonstrate that single amino acid substitutions 
that are calculated to have a modest impact on protein stability can 
nevertheless impact protein function, ranging from nonfunctional to 
benign. The needed improvements in predictive methods rely on 
datasets that interrogate the impact of many alleles of a protein. The 58 
alleles of CBS described here, combined with previous studies, offer an 
increasingly useful resource for such developments. 
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Figure 6 Protease digestion of seven purified M BP-CBS protein 
alleles. The conditions of the thermolysin digests are as in Figure 5, 
and the order of proteins from the top is that of heme incorporation, 
from highest to lowest. In each case the relevant doublet band of full- 
length CBS is indicated by the arrow head. The lower portion of the 
gel showing the cleaved MBP band is not shown. 

ACKNOWLEDGMENTS 

We thank Warren Kruger for the generous gift of the plasmid pHUCBS. 
Thanks to David Baker, Philip Bradley, and Elizabeth Kellogg for aid in 
the selection of the CBS variants and generating the Rosetta Predictions. 
The Macrolab at UC Berkeley constructed the 6His-MBP-CBS bacterial 
expression clone. We thank Lori Kohlstaedt of the QB3 Proteomics/ 
Mass Spectrometry laboratory for confirming the identity of the protein 
bands. We also thank all of the members of the Rine and Marqusee labs 
for their support during the course of this work. This work was 
supported by U.S. Army Research Office grants W911NF-06-01-0166, 
W911NF-09-1-0211 and W91 INF- 10- 1-0496 (J.R.), National Institutes 
of Health (NIH) grant GM-072859 (J.R.), and NIH GM-50945 (S.M.). 

LITERATURE CITED 

Abecasis, G. R., A. Auton, L. D. Brooks, M. A. DePristo, R. M. Durbin et al, 
2012 An integrated map of genetic variation from 1,092 human ge- 
nomes. Nature 491: 56-65. 

Ames, B. N., I. Elson-Schwab, and E. A. Silver, 2002 High-dose vitamin 
therapy stimulates variant enzymes with decreased coenzyme binding 
affinity (increased K(m)): relevance to genetic disease and polymor- 
phisms. Am. J. Clin. Nutr. 75: 616-658. 

Bonneau, R, I. Ruczinski, J. Tsai, and D. Baker, 2002 Contact order and ab 
initio protein structure prediction. Protein Sci. 11: 1937-1944. 



Bowman, G. R, and P. L. Geissler, 2012 Equilibrium fluctuations of a single 
folded protein reveal a multitude of potential cryptic allosteric sites. Proc. 
Natl. Acad. Sci. USA 109: 11681-11686. 

Das, R, and D. Baker, 2008 Macromolecular modeling with Rosetta. Annu. 
Rev. Biochem. 77: 363-382. 

Edelhoch, H., 1967 Spectroscopic determination of tryptophan and tyrosine 
in proteins. Biochemistry 6: 1948-1954. 

Eng, J. K., A. L. MacCormak, and J. R. Yates, 3rd, 1994 An approach to 
correlate tandem mass spectral data of peptides with amino acid se- 
quences in a protein database. J. Am. Soc. Mass Spectrom. 5: 976-989. 

Guirard, B. M., B. N. Ames, and E. E. Snell, 1971 Salmonella typhimurium 
mutants with alternate requirements for vitamin B 6 or isoleucine. J. 
Bacteriol. 108: 359-363. 

Hnizda, A., V. Spiwok, V. Jurga, V. Kozich, M. Kodicek et al, 2010 Cross- 
talk between the catalytic core and the regulatory domain in cystathionine 
beta- synthase: study by differential covalent labeling and computational 
modeling. Biochemistry 49: 10526-10534. 

Hnizda, A., T. Majtan, L. Liu, A. L. Pey, J. F. Carpenter et al, 

2012a Conformational properties of nine purified cystathionine beta- 
synthase mutants. Biochemistry 51: 4755-4763. 

Hnizda, A., T. Majtan, L. Liu, A. L. Pey, J. F. Carpenter et al, 

2012b Correction to conformational properties of nine purified cysta- 
thionine beta-synthase mutants. Biochemistry 51: 5540. 

Ito, H., Y. Fukuda, K. Murata, and A. Kimura, 1983 Transformation of 
intact yeast cells treated with alkali cations. J. Bacteriol. 153: 163-168. 

Kellogg, E. H., A. Leaver-Fay, and D. Baker, 2011 Role of conformational 
sampling in computing mutation-induced changes in protein structure 
and stability. Proteins 79: 830-838. 

Kery, V., G. Bukovska, and J. P. Kraus, 1994 Transsulfuration depends on 
heme in addition to pyridoxal 5 '-phosphate. Cystathionine beta-synthase 
is a heme protein. J. Biol. Chem. 269: 25283-25288. 

Kim, C. E., P. M. Gallagher, A. B. Guttormsen, H. Refsum, P. M. Ueland 
et al, 1997 Functional modeling of vitamin responsiveness in yeast: 
a common pyridoxine-responsive cystathionine beta-synthase mutation 
in homocystinuria. Hum. Mol. Genet. 6: 2213-2221. 

Kopecka, J., J. Krijt, K. Rakova, and V. Kozich, 2011 Restoring assembly 
and activity of cystathionine beta-synthase mutants by ligands and 
chemical chaperones. J. Inherit. Metab. Dis. 34: 39-48. 

Kraus, J. P., M. Janosik, V. Kozich, R. Mandell, V. Shih et al, 

1999 Cystathionine beta-synthase mutations in homocystinuria. Hum. 
Mutat. 13: 362-375. 

Kraus, J. P., V. Kozich, and M. Janosik, 2013 CBS Mutation Database. 
Available at: http://cbs.lfl.cuni.cz/index.php. Accessed: August 5, 2013. 

Kruger, W. D., and D. R. Cox, 1994 A yeast system for expression of human 
cystathionine beta-synthase: structural and functional conservation of the 
human and yeast genes. Proc. Natl. Acad. Sci. USA 91: 6614-6618. 

Kruger, W. D., and D. R. Cox, 1995 A yeast assay for functional detection of 
mutations in the human cystathionine beta-synthase gene. Hum. Mol. 
Genet. 4: 1155-1161. 

Levy, S., G. Sutton, P. C. Ng, L. Feuk, A. L. Halpern et al, 2007 The diploid 
genome sequence of an individual human. PLoS Biol. 5: e254. 

Majtan, T., L. Liu, J. F. Carpenter, and J. P. Kraus, 2010 Rescue of cysta- 
thionine beta-synthase (CBS) mutants with chemical chaperones: purifi- 
cation and characterization of eight CBS mutant enzymes. J. Biol. Chem. 
285: 15866-15873. 

Marini, N. J., J. Gin, J. Ziegle, K. H. Keho, D. Ginzinger et al, 2008 The 
prevalence of folate-remedial MTHFR enzyme variants in humans. Proc. 
Natl. Acad. Sci. USA 105: 8055-8060. 

Marini, N. J., P. D. Thomas, and J. Rine, 2010 The use of orthologous 
sequences to predict the impact of amino acid substitutions on protein 
function. PLoS Genet. 6: el000968. 

Mayfield, J. A., M. W. Davies, D. Dimster-Denk, N. Pleskac, S. McCarthy 
et al, 2012 Surrogate genetics and metabolic profiling for character- 
ization of human disease alleles. Genetics 190: 1309-1323. 

Meier, M., M. Janosik, V. Kery, J. P. Kraus, and P. Burkhard, 2001 Structure 
of human cystathionine beta-synthase: a unique pyridoxal 5 '-phosphate- 
dependent heme protein. EMBO J. 20: 3910-3916. 



G3'Genes | Genomes | Genetics 



Volume 3 October 2013 I Cofactor Dependence of Human CBS Protein I 1627 



Meier, M., J. Oliveriusova, J. P. Kraus, and P. Burkhard, 2003 Structural 
insights into mutations of cystathionine beta- synthase. Biochim. Biophys. 
Acta 1647: 206-213. 

Mumberg, D., R. Muller, and M. Funk, 1995 Yeast vectors for the con- 
trolled expression of heterologous proteins in different genetic back- 
grounds. Gene 156: 119-122. 

Oliveriusova, J., V. Kery, K. N. Maclean, and J. P. Kraus, 2002 Deletion muta- 
genesis of human cystathionine beta-synthase. Impact on activity, oligomeric 
status, and S-adenosylmethionine regulation. J. Biol. Chem. 277: 48386-48394. 

Park, C, and S. Marqusee, 2006 Quantitative determination of protein 
stability and ligand binding by pulse proteolysis. Curr. Pro toe. Protein 
Sci. Chapter 20: Unit 20-11. Wiley, New York. 

Rohl, C. A., C. E. Strauss, K. M. Misura, and D. Baker, 2004 Protein 
structure prediction using Rosetta. Methods Enzymol. 383: 66-93. 



Tabb, D. L., W. H. McDonald, and J. R. Yates, 3rd, 2002 DTASelect and 
contrast: tools for assembling and comparing protein identifications from 
shotgun proteomics. J. Proteome Res. 1: 21-26. 

Tennessen, J. A., A. W. Bigham, T. D. O'Connor, W. Fu, E. E. Kenny et at, 
2012 Evolution and functional impact of rare coding variation from 
deep sequencing of human exomes. Science 337: 64-69. 

Wei, Q., L. Wang, Q. Wang, W. D. Kruger, and R. L. Dunbrack, Jr., 

2010 Testing computational prediction of missense mutation pheno- 
types: functional characterization of 204 mutations of human cystathio- 
nine beta synthase. Proteins 78: 2058-2074. 

Yue, P., Z. Li, and J. Moult, 2005 Loss of protein structure stability as 
a major causative factor in monogenic disease. J. Mol. Biol. 353: 459-473. 

Communicating editor: F. P. Roth 



1628 I D. Dimster-Denk et a/. 



:~-ZLG3' Genes | Genomes | Genetics 



